Cost-sensitive learning and decision making for massachusetts pip claim fraud data

نویسندگان

  • Stijn Viaene
  • Richard A. Derrig
  • Guido Dedene
چکیده

In many real-life decision making situations the default assumption of equal (mis-)classification costs underlying pattern recognition techniques is most likely violated. Consider the case of insurance claim fraud detection for which an early claim screening facility is to be built to decide upon the nature of an incoming claim as either suspicious or not. This decision typically forms the basis for routing the claim through different claims handling workflows. Claims that pass the initial (automated) screening phase are settled swiftly and routinely, involving a minimum of transaction processing costs. Claims that are flagged as suspicious need to pass a costly state verification process, involving (human) resource-intensive investigation. Here, cost-sensitive learning and decision making bring help for making cost-benefit-wise optimal decisions. In this paper we investigate the issue of cost-sensitive classification for a data set of Massachusetts closed personal injury protection (PIP) insurance claims that were previously investigated for suspicion of fraud by domain experts and for which cost information has been obtained. After a theoretical exposition on cost-sensitive learning and decision making methods, we then apply these methods to the claims data at hand to contrast the predictive performance of the documented methods for a variety of decision tree and rule learners. Standard logistic regression and (smoothed) naive Bayes are used as benchmarks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...

متن کامل

Credit Card Fraud Detection using Data mining and Statistical Methods

Due to today’s advancement in technology and businesses, fraud detection has become a critical component of financial transactions. Considering vast amounts of data in large datasets, it becomes more difficult to detect fraud transactions manually. In this research, we propose a combined method using both data mining and statistical tasks, utilizing feature selection, resampling and cost-...

متن کامل

MLP-ARD vs. Logistic Regression and C4.5 for PIP Claim Fraud Explication

In this paper we demonstrate the explicative capabilities of multilayer perceptron neural networks (MLP) with automatic relevance determination (ARD) weight regularization for personal injury protection (PIP) automobile insurance claim fraud detection. The ARD objective function hyperparameter scheme provides a means for soft input selection as it allows to determine which predictor variables a...

متن کامل

A Comparison of State-of-the-art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection

Several state-of-the-art binary classification techniques are experimentally evaluated in the context of expert automobile insurance claim fraud detection. The predictive power of logistic regression, C4.5 decision tree, k-nearest neighbor, Bayesian learning multilayer perceptron neural network, least-squares support vector machine, naive Bayes, and tree-augmented naive Bayes classification is ...

متن کامل

Fraud Detection by Stacking Cost-Sensitive Decision Trees

Worldwide, billions of euros are lost every year due to credit card fraud. Increasingly, fraud has diversified to different digital channels, including mobile and online payments, creating new challenges as innovative new fraud patterns emerge. Hence, it remains challenging to find effective methods of mitigating fraud. Existing solutions include simple if-then rules and classical machine learn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. Intell. Syst.

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2004